59 research outputs found

    Cynomolgus monkey testicular cDNAs for discovery of novel human genes in the human genome sequence

    Get PDF
    BACKGROUND: In order to contribute to the establishment of a complete map of transcribed regions of the human genome, we constructed a testicular cDNA library for the cynomolgus monkey, and attempted to find novel transcripts for identification of their human homologues. RESULT: The full-insert sequences of 512 cDNA clones were determined. Ultimately we found 302 non-redundant cDNAs carrying open reading frames of 300 bp-length or longer. Among them, 89 cDNAs were found not to be annotated previously in the Ensembl human database. After searching against the Ensembl mouse database, we also found 69 putative coding sequences have no homologous cDNAs in the annotated human and mouse genome sequences in Ensembl. We subsequently designed a DNA microarray including 396 non-redundant cDNAs (with and without open reading frames) to examine the expression of the full-sequenced genes. With the testicular probe and a mixture of probes of 10 other tissues, 316 of 332 effective spots showed intense hybridized signals and 75 cDNAs were shown to be expressed very highly in the cynomolgus monkey testis, but not ubiquitously. CONCLUSIONS: In this report, we determined 302 full-insert sequences of cynomolgus monkey cDNAs with enough length of open reading frames to discover novel transcripts as human homologues. Among 302 cDNA sequences, human homologues of 89 cDNAs have not been predicted in the annotated human genome sequence in the Ensembl. Additionally, we identified 75 dominantly expressed genes in testis among the full-sequenced clones by using a DNA microarray. Our cDNA clones and analytical results will be valuable resources for future functional genomic studies

    Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones

    Get PDF
    The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology

    Integrative annotation of 21,037 human genes validated by full-length cDNA clones.

    Get PDF
    publication en ligne. Article dans revue scientifique avec comité de lecture. nationale.National audienceThe human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology

    Stefan Müller (ed): Primate cytogenetics

    No full text

    Chromosome Aberration Data

    No full text
    Table 1. Dicentric chromosome scoring data of peripheral blood mononuclear cells: Version 1 (30 November 2010)

    The HOX complex neighbored by the EVX gene, as well as two other homeobox-containing genes, the GBX-class and the EN-class, are located on the same chromosomes 2 and 7 in humans

    Get PDF
    AbstractTwo newly identified human homeobox-containing genes, GBX1 and GBX2, are closely related genes, as are members of the other homeobox genes, EN-1 and EN-2. GBX1 and EN-2 have been mapped to chromosome 7q36. The present study shows that GBX2was mapped to chromosome 2q37. EN-1 was mapped to chromosome 2q14. Moreover, two HOX complexes neighbored by the EVX gene, HOXA and HOXD, are located at chromosome 7p15-p14 and 2q31-q37, respectively. Thus, it is possible that these homeobox genes were linked to each other on an ancestral genome and that the ancestral chromosome segment was duplicated during evolution

    Multiplex Fluorescence In Situ Hybridization Visualizes a Wide Range of Numerical and Structural Chromosome Changes Induced in Cultured Human Lymphocytes by Ionizing Radiation

    No full text
    Multiplex or multicolor fluorescence in situ hybridization (M-FISH) is a molecular cytogenetic technique that permits the simultaneous visualization of all human chromosomes (22 autosomal chromosomes and 2 sex chromosomes) in different colors. This technique facilitates definite karyotype analysis, enabling the unambiguous identification of complex chromosomal rearrangements. The applications of M-FISH for detecting chromosomal changes are manifold. here we used this technique to analyze the induction and persistence of chromosomal aberrations in cultured human peripheral blood lymphocytes following gamma irradiation

    RANDOM EFFECTS IN MEASUREMENT OF RADIATION EXPOSURE BY BIODOSIMETRY

    No full text
    Biodosimetry is one of convenient, cost-effective methods for measurement of radiation exposure. We discuss di-centric analysis, which is an estimation procedure of radiation dosage by counting number of di-centric chromosomes in cells of an exposed individual. In the di-centric analysis we have to make our own standard response curve by using learning data prior to applying test-data. The learning data are constructed by exposing cells to several fixed dosages and counting di-centric chromosomes. One of long-standing problems in di-centric analysis is whether random effects should be accounted in the analysis. In our study, we tried to dissect random effects from the counting data by using counting data constructed by using cells taken from 13 individuals. In di-centric analysis for low dosage we usually assume quadratic response curve, where Poisson intensity of counting per cell is a quadratic function of the dosage. We investigated separation of random effects and fixed effects from the estimated response curves of each individual. We adopted a Bayesian hierarchical model and estimated fixed and random effects by using MCMC. We found random effects are relatively small and thus we can expect fixed effects can be estimated precisely. Our result suggests that we have to prepare better standard response curve by estimating fixed effects by using large learning data.XXVIth International Biometric Conferenc
    corecore